Method, system and apparatus for providing stateful information redaction

ABSTRACT

A system ( 101 ) for implementing redaction rules in compliance with an organization&#39;s privacy policy, where the system intercepts messages between an information source ( 103 ) and an information destination ( 102 ), modifies the message contents based on redaction rules ( 106 ) and forwards the redacted contents over to the client. The system also maintains a record of the redacted information and updates the contents of any message submitted by the client ( 102 ) in order to maintain database integrity.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims priority to U.S. Non-Provisional Application No. 11/425,524 entitled “Method and System for Providing Granular Data Access Control for Server-Client Applications,” filed by the same first named inventor on Jun. 21, 2006.

BACKGROUND OF THE INVENTION

Securing access to enterprise resources is a balancing act between usability and control. It requires vigilance, persistence, care, and effort. The process starts with risk and vulnerability assessment of the enterprise's assets followed by the security policy definition. When business needs require dispensing data to the Internet and sharing information with partner networks, a unique set of security challenges that cannot be solved by the traditional solutions of firewalls and virtual private networks is presented. In addition to other characteristics, enterprise security policies determine what resources must be available, to whom, and under what circumstances. A related concern deals with the privacy matters, where more information might be dispensed by the computer systems than necessary. In some cases, there may not be sufficient control over such data, which might be supplied by third party providers. In other cases, it becomes costly to alter the systems without significant time and money.

A typical scenario would be a call center setting. The call center employees view and process data provided by third party institutions that include health care and financial institutions. In most cases, the data includes very sensitive information such as social security numbers and credit card numbers. The call center employees do not need such information for the purpose of performing their daily duties. However, having such information available makes it susceptible to theft and can lead to identity fraud.

It is desirable to have control over such data, where the information could be dispensed on an as-needed basis. Ideally, it needs to be controlled based on the role of the user, as well as the source of the data. Such a policy is referred to as “redaction rule” in the context of this document and the process of removing such information is referred to as information redaction. Enforcement of these redaction rules is extremely desirable but no easy implementation exists without a massive investment in resources, that ends up altering the original system.

In order for such a system to be effective, the information redaction must take place as close to the information source as possible. Some prior art systems try to perform redaction close to the destination. While such a solution would work for ordinary users, they do not protect from the savvy identity thieves. In Patent Application Publication Number US 2004/0015729 A1 by Elms et al, published Jan. 22, 2004, the system relies on detecting a client's presence and redacting information from the display. In this scenario, the information has already made its way to the computer system and is simply being obscured from the viewer. A packet sniffer running on the system will easily compile and reveal this information. As an option, Elms et al provide a mediator that is configured through a browser. The mediator provides the advantage of redacting information before it makes it to the computer but it can easily be bypassed by anyone with the knowledge of configuring a proxy. Such an action would result in full display of the information. Finally, the cited solution is limited to systems that only display information through a browser and is not applicable to either machine-to-machine information exchange where machines exchange information using legacy or modern protocols or to terminal emulation programs and thick client desktop applications.

It is desirable to have a cost effective, easily configurable system that enables granular redaction control over multiple applications. Prior art access controls do not provide sufficient granularity and also do not work across multiple application protocols. Accordingly, a new data access control methodology and system is needed that must (a) provide information redaction based on a client's role and application being accessed (b) redact information in transit close to the origination source without requiring application alteration and (c) work across multiple application protocols.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows the deployment of the redaction appliance within a network and the redaction process flow;

FIG. 2 extends the representation of FIG. 1 to show the handling of multiple protocols;

FIG. 3 explains the redaction process as information travels across the network;

FIG. 4 shows the benefit to the institutions deploying the redaction appliances.

FIG. 5 shows the contents of an HL7 protocol based message before and after the redaction appliance.

FIG. 6 shows the contents of an HTTP protocol based message before and after the redaction appliance.

SUMMARY OF THE INVENTION

A system that is capable of intercepting, parsing, and reconfiguring networked application messages as they flow between an information source and destination. The former is an application serving data and latter is a client machine, process, appliance, software system or another application. The system is a network appliance, based on embedded technology, with multiple physical interfaces. It contains technology for interfacing with an external redaction rules engine and a client role identification service. The system identifies the client associated with a network message, obtains the redaction rules that would apply to such a message, parses the data to detect if it contains any information that violates the redaction rules, and replaces such information with benign contents while maintaining the integrity of the message.

In an alternative embodiment of the invention, in addition to above, the redacted contents are also maintained within the redaction appliance. If the client tries to update the original information with modifications, the redacted contents are replaced within the message to maintain the integrity of the data within the application server database.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT

The invention explained herein is called a redaction appliance. The preferred embodiment comprises an embedded computer based on a standard operating system such as Linux. The kernel is compiled with bare minimum options in order to harden it and minimize any attacks against it. The operating system is also assembled from scratch in order to provide the highest performance with a minimal amount of services running on it. The appliance has a minimum of two logical network connections, which in specific embodiments can also be implemented as physical connections, where one of the connections services the clients or end users and the others connect to the applications. The appliance is placed as close to the information source as possible. It runs specialized processes that are capable of intercepting the communications to and from the information source, parsing it and detecting any information that could be deemed as sensitive by government regulations or by the company's own privacy policies. Such sensitive information is seamlessly replaced with masked information that does not violate the privacy policies, yet maintains the integrity of the message and the protocol carrying the message. The masking of such information can be performed in a uniform manner for all the recipients, but the preferred embodiment associates the client with the request and performs the masking based on the client's role within the organization.

FIG. 1 shows the deployment of the redaction appliance in a simplified setting. The appliance (101) is deployed within the network where it intercepts traffic from the internal application servers (103) as well as external application servers (104), as it flows towards the client machine (102). The distinction between the two types of application servers is important with regards to the appliance's benefits, even though the process of redaction is not affected. In case of internal application servers, the deploying organization may have control over the application and could utilize redaction through alternate means, such as application reconfiguration or writing new source code. In case of external application servers, the data is served by external content or application providers, which allows no control to the deployment organization, thus immensely increasing their liability in the absence of this invention. The redaction appliance interacts with two external databases in order to perform its functions. The first is a redaction rules engine (105) and the second is a user and role identifier database (106). The user and role identifier database belongs to the deploying organization and minimally contains user names, their authentication credentials and their roles within the organization. The database can reside within an LDAP or SQL server, although other options are also possible. The redaction rules are defined for the appliance usage by a system administrator and reside within an LDAP server. Only one such database needs to be defined even if multiple applications and redaction appliances are involved.

FIG. 1 also shows the flow of the redaction process. The process starts with the user issuing a request for information. Such a request is directed at the application server. It is intercepted by the redaction appliance and the complete request is assembled. The request is forwarded to the server and a response obtained as a result of this request. While FIG. 1 shows interaction with the internal application servers (103), the case is identical for communicating with external application servers (104). The appliance configuration options define the manner of redirection of the request. The appliance associates the request and response with the specific client by examining the client identification information. In one scenario, such identification information will be contained within an authentication request that could be directed at the same application server or a centralized authentication server. In an alternate scenario, the authentication information would be contained within the request for information. Once the authentication process succeeds, the redaction appliance uses the said identification information to obtain the client role(s) from the role database (106). The corresponding redaction rules for this role are retrieved from the rules database (105) and stored within the appliance for the remainder of the session. The appliance applies the redaction rules and masks out part of the information. This information is not conveyed to the client and hence not displayed at the information destination.

In an alternate embodiment, that is more suitable for database applications where the client requests a single or plurality of records, the redaction appliance not only redacts the information but also keeps track of the original state of the records returned by the server by associating them with the client session before returning them to the client. More specifically, the redacted information is maintained by the redaction appliance. As is usually the case with database applications, the client may subsequently modify the single or plurality of records it has received and submit all or subset of those to the server within a subsequent request for update. The redacted information is re-inserted within that request by the said redaction appliance before the request is forwarded to the server, so that the database integrity is not compromised by partially redacted records.

FIG. 2 shows a more typical deployment of the simplistic case depicted in FIG. 1. The redaction appliances detect, parse and process information at the application protocol level. In the preferred embodiment, each appliance focuses on one application protocol in order to keep the network high performance and not experience any perceptible delays. In an alternate embodiment, a single appliance can handle multiple application protocols. Some of the network protocols that the appliances work with include HTTP, LDAP, 3270/5250 over Telnet and TDS, although the system architecture supports all application protocols by deploying the appropriate protocol parser. FIG. 2 shows the use of one redaction appliance per protocol. Even though multiple appliances are deployed, they all use the same user database and redaction rules database.

FIG. 3 provides a step-by-step flow of the redaction process that has been partially discussed above during the description of FIG. 1. The redaction process starts when the redaction appliance detects a client request and the corresponding server response (301). The appliance identifies the client making the request using the authentication information embedded within the request. Once the client has been identified, the client's role is retrieved from the role database (106). The redaction rules associated with this role are retrieved from the redaction rules database (105) and associated with this client's session. This process is carried out only once during a single client session. All subsequent server responses directed to this client are subjected to the redaction rules applicable to his or her roles (302). These responses are parsed by the protocol parser for the application protocol in use for that session (303). As an example, the protocol can be HTTP or 3270 terminal emulation over telnet. Parsing the message stream allows the redaction appliance to determine if any of the redaction rules need to be enforced. On making such a determination, the appliance makes a temporary copy of the sensitive information (304) and replaces it with a predetermined character, such as Xs or blanks (305). The client does not get to see the redacted information, yet proceeds with his required duties, such as information updates. Once the client submits an update, the appliance intercepts the updated message (306) and restores the redacted contents at the appropriate place(s) within the message (307). The resulting message is submitted to the server in order to complete the process (308). It is important to note that the redaction rules will be enforced differently for different clients if they carry different organizational roles. A specific client may see three information items redacted, while another one may see only one such item redacted.

FIG. 4 presents the benefit of this system from the deploying organization's point of view. This is an example of how the background process flow affects a user working on the system. A person has called into a call center to update his phone number. The server sends a message that contains this callers information, including his social security number and credit card number (401). Upon receiving the servers message, the redaction appliance intercepts it before it makes its way to the call center employee's screen. The redaction appliance applies the redaction rules and masks out all but the last four digits of the social security number and the credit card number (402). The call center employee updates the phone number and submits the information back to the server (403). This message is intercepted by the redaction appliance, which restores the full credit card number and social security number (404) before sending the message back to the server. It should be noted that the redaction appliance works at the message level, where each message is composed of a multitude of network packets. The redaction rules are applied to the message as a whole and not to the individual packets.

FIG. 5 and FIG. 6 provide a behind the scenes look at the message as it travels past the redaction appliance. An identical set of tasks is performed in both of these cases with one major difference—the protocol parser being applied in each case. Two different protocols have been illustrated in order to emphasize that the redaction process works in a protocol independent manner.

FIG. 5 shows a message from a legacy version of the HL7 protocol, that is being used to communicate a patient's medical information. The top section of the figure shows the original message that is received at the redaction appliance (101) from the application server (103). In this case, the redaction appliance is capable of parsing the HL7 protocol and analyzing its contents. The content parser discovers that the message contains the patient's full social security number and test results (highlighted in gray). The privacy policy for this specific viewers role only allows him to view the last four digits of the social security number and does not allow for the viewing of the test results. The redaction appliance replaces the first five digits of the social security number with the letter X and does the same for the test result values. The resulting message is shown in the bottom portion of FIG. 5 and is forwarded to the viewing client (102).

FIG. 6 shows a different message parser at work, this time for the HTTP protocol. The top portion of the figure shows an HTTP message with HTTP headers followed by an XML document envelop. This message is received by the redaction appliance (101) from an external application server (104). The redaction appliance contains an HTTP protocol parser that locates a patient's social security number and test results within the message. Following the corporate privacy policy, the redaction appliance replaces all but the last four letters of the social security number with the letter X, and does the same for the test result values. This transforms the message into the redacted message shown at the bottom of the FIG. 6. The message is sent over to the client (102) for viewing after that. 

1. A method of enhancing information privacy and confidentiality wherein a specific protocol request is received from a client for an application accepting the same protocol and the request from the said client, as well as the corresponding response from the said application is intercepted by a redaction appliance, further comprising: a. associating the said request with a session object that is maintained by the said redaction appliance; b. forwarding the said request to the said application and intercepting the response; c. applying the redaction rules associated with the session object to the response; d. redacting the information in the response based on the redaction rules while maintaining the integrity of the said protocol and the message contained within; and e. returning the modified response to the requesting client; whereby the returned response does not contain any sensitive information that the client is not authorized to receive.
 2. The method of enhancing information privacy and confidentiality of claim 1 wherein the message received from the said client is the first such message received that also contains the identification information of the said client and further including: a. verifying the said client's identity and then roles from a client and role identifier; b. obtaining the redaction rules for roles associated with the said positively identified client and roles from a redaction rules database; c. creating a session object, associating the rules with the session object and maintaining that object at the redaction appliance for purpose of using it for this and subsequent requests from the same said client and the same said application;
 3. The method of enhancing information privacy and confidentiality of claim 1, wherein the client is a user using a user interface such as a browser.
 4. The method of enhancing information privacy and confidentiality of claim 1, wherein the client is a machine or a process.
 5. The method of enhancing information privacy and confidentiality of claim 1 wherein the singular or plurality of the said applications are external to the organization.
 6. The method of enhancing information privacy and confidentiality of claim 1 wherein the singular or plurality of the said applications are internal to the organization.
 7. The method of enhancing information privacy and confidentiality of claim 1 wherein one or more of the said applications are internal to the organization and remainder of the one or more of the said applications are external to the organization.
 8. The method of enhancing information privacy and confidentiality of claim 1 wherein the data from the singular or plurality of the said internal or external applications is encrypted and the method further includes decryption of information before redaction and re-encrypting it before forwarding it to the said client.
 9. The method of enhancing information privacy and confidentiality of claim 1 wherein a separate redaction appliance is used for every distinct protocol.
 10. A method of enhancing information privacy and confidentiality wherein a specific protocol request is received from a client for a database application accepting the same protocol and the request from the said client, as well as the corresponding response from the said database application is intercepted by a redaction appliance, further comprising: a. associating the said request with a session object that is maintained by the said redaction appliance; b. forwarding the said request to the said database application and intercepting the response; c. applying the redaction rules associated with the session object to the response; d. redacting the information in the response based on the redaction rules while keeping the integrity of the said protocol and the message contained within; and e. returning the modified response to the requesting client; f. retaining the entire state of all data records received from the said database application within the response locally at the said redaction appliance before returning it to the client; more specifically, saving the information that has been redacted; g. waiting for a subsequent request from the client containing the one or plurality of the same records whose information that was not redacted has been updated by the client; h. re-inserting the information that was redacted earlier into the received said request; and i. forwarding the request with reconstructed complete records to the said database application. whereby the integrity of the information is retained without having to modify the database or having to exposing sensitive information to an unauthorized client.
 11. The method of enhancing information privacy and confidentiality of claim 10 wherein the message received from the said client is the first such message received that also contains the identification information of the said client and further including: a. verifying the said client's identity and then roles from a client and role identifier, b. obtaining the redaction rules for roles associated with the said positively identified client and roles from a redaction rules database; c. creating a session object and maintaining that object at the redaction appliance for purpose of using it for subsequent requests from the same said client and the same said database application; whereby the integrity of the information is retained without having to modify the database or having to exposing sensitive information to an unauthorized client.
 12. The method of enhancing information privacy and confidentiality of claim 10 wherein the data from the singular or plurality of the said internal or external applications is encrypted and the method further includes decryption of information before redaction and re-encrypting it before forwarding it to the said client.
 13. The method of enhancing information privacy and confidentiality of claim 10 wherein a separate redaction appliance is used for every distinct protocol.
 14. A machine for redacting sensitive data from responses from an application to requests made by a plurality of clients and hence enforcing established enhanced information privacy and confidentiality policy comprising: a. a redaction rules database; b. session object database for maintaining distinct session objects for each of the said clients; c. redaction rules enforcing the system to enforce the redaction rules;
 15. The machine for redacting sensitive data of claim 14 further including: a. a protocol parser to extract the identifying information of each of the said clients from within the received protocol messages; b. a subsystem to create session objects based on roles returned by the User Roles Identifier and redaction Rules;
 16. The machine for redacting sensitive data of claim 14 wherein the applications are database applications and further including: a. a database of records for temporarily storing the database records that are redacted and are associated with respective session objects; b. a record reconstructing subsystem to reconstruct complete records from the modified records received from said clients that were redacted earlier; c. a subsystem to update the messages using the said reconstructed records;
 17. The machine for redacting sensitive data of claim 14, wherein the machine is capable of interacting with any user-interface used by a user including a browser, a terminal emulation client or a thick client application.
 18. The machine for redacting sensitive data of claim 14, wherein the said machine is capable of interacting with a client that is a machine or a process.
 19. The machine for redacting sensitive data of claim 14, further including; a. a decryption subsystem to decrypt the requests received from the said clients and to responses received from the said applications before redaction; and b. an encryption subsystem to re-encrypt the requests to send them to the said applications and responses after redaction to send them to the said clients.
 20. The machine for redacting sensitive data of claim 14, wherein a separate redaction appliance is used for every distinct protocol. 